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Raw data pertaining to thesis topics in the fields 
of Chemical and Petroleum Imgineering, submitted to 144 
universities and other institutions of higher learning, 
are indexed by keywords-in-context and dispvlayed per- 
mutedly. The printed output reference listing is entered 
by keyword. Serial references to author, school, and 
hardcopy may be read directly. 

Reference listings were prepared using an IBM 7040 
Fixed-word-length Computer. The computer programs ere 


written in Fortran (IV) computer language. 
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Purvose of Investigetion Information accumulation, once 
the very object of man's forme] educetion, is ranidly be- 
coming a giant boulder to be throwm in tre vath of men's 
search for meaningful date. The volume of printed metter 
on virtually any subject ryrows hourly at 2 swift pace end 
men has been slow to grespv this reality. 

This thesis deals with one aspect of informétion hand- 
ling wherein comvuter vrocessing technicues ére utilized 
tO Deriorm cotalogiang, slyphebetizineg, indexing, and print— 
ing of a large volume of hnistoricel] data nertetnine to writ- 
ings in a specialized field of endeavor (ie: Chemical and 
Petre Teun Engineering). It is the author's aim to determine 
whether it is feasible to create a permuted title index with 
an engineering oriented, fixed-word-lenrcth digitel computer. 
If so, such e program will be written. 
Descrivtion of Terms Used "Permuted Title Indexing" and 
"Keyword-in-Context Indexing" (KVIC) are synonomous end 
refer to the vrocedure of shifting sequences of words. 
The imvortent word is nrinted in e designated keyword 
column while retaining its contextual significence 
through reproduction of surrounding words. "Discard words" 


"vevwords." "Discerd 


are the ooverse of imvortant words or 
words" simoly are vassed-by in the indexing procedures. 


Notation used herein for comouter programs end construction 





of computer programs is done in the Fortran (IV) computer 
language. One exception is the PACK Subroutine which is 
written in the MAP computer language, 


Permuted Indexing as a Part of the Information Handling 
Spectrum Information handling = Documentation 

relict retrieval 

Data sscdiiani ied 

ravaneer capa processing 3 
The above equality represents the information hendling 
spectrum. Permuted title indexing is but a small portion 
of information retrieval, and while the ultimate, futur- 
istic information handling systems may well be capable 
of completely automating data-indexing, the current prec- 
ticalities (machine capabilities and economics) point 
toward hard copy index listings rather than machine-stored 
date which may be indexed each time an inquiry is presented 
on the subject matter which is in machine memory. Carried 
one step further, the futuristic system should easily Ye 
able to produce not only indexes, but also abstracts and 
full-text reproductions. While scientists and humanists 
alike should await with great anticipation the coming of 
the automated research era, there remains today volume 
after volume of literature lists which may be indexed 
with the machines of today for the benefit of today's 
researcher. This is the problem to which the author 


addresses himself. 


Permuted title indexing has the great advantage of 





being relatively inexpensive when compared to stored indexing 
procedures. However, the ease of creating permuted title 
indexes is counter-balanced by the lack of any qualitative 
judgements as to the value of the word being indexed in 
relation to its possible importance to the user or re- 
searcher. The only determination made is whether the word) 
is to be kept ( a keyword ) or to be discarded. This de- 
terminetion, while discretionary on the part of the pro- 
grammer, is ‘nonetheless completely quantitative in nature, 
Ori cin of Permuted Title Indexes One of ie earliest 
ventures in machine~generated--indexes was described to 
a meeting of The American Chemical Society in Atlantic 
City, New Jersey, on September 14, 1959, by Mr. Hans 
Peter’ Luhn of tie International Business Machine 
Corporation. 7 It was on this occasion that the word KWIC 
wes added to the computer field's dictionary of buzz words, 
(While often associated with IB”, KWIC is not a trademark 
of that company) In the six years following the intro- 
duction of KWIC by Mr. Luhn, similarly constructed indexes 
have been generated in many segments of industry end ed- 
ucation. Three examples of such applications are: 

Chevron KW1IC Index 5 


Journal of Assoclation for Comput- 
ing Machines 


Kansas Slavic Index iG 


The first is an index of articles published in The Journal 


of Petroleum Technology and The Society of Petroleum Engi- 


” 





neers Magazine of interest to the Chevron 011 Company. 
The second example is an annual recapitulation of all 
articles in a trade publication. The third is in the 
Kansas State University and University of Kansas libraries 
as an ancillary aid to researchers in the field of Slavic 
writings. 

These three examples serve to point uv the wice renge 
of subs ee es which may be served by permuted title indexes. 


While each example is somewhat different in format and 


construction, the techniques employed do not vary widely. 





Chanter II 

CONSIDERATIONS IN ESTABLISHING AN INFORMATION RETRIEVAL 
SYST an : : 

General Requirements Any system to retrieve information 
must be designed to fit the problem at hand. This can’ 
be accomplished in one of two ways. First, the ayaeen 
may be designed from the beginning or secondly, a more 
general application may be modified to fit the problem. 
To attempt to design an information retrieval system which 
will apply to all categories (medicine, business, refining, 
engineering, etc.) is to presume that the end-user will have 
the requirement to know all things about e211 catefories, 
This presumption is falacious with passibly the exception of 
one or two huge libraries which might serve as repositories 
for all the world's written works. 

In view of this, the parameters of any problem in in- 
formation retrieval may be reduced to three: 1) the data 
which is available 2) the processing system or systems 
which may be available to accomplish the task 3) the time 
dimension or urgency of the matter. The data perameter 
encompesses determination of file size (how much data 
should be indexed), conversion to a format which is compat- 
ible with the processing system to be used, assignment of 
weighting factors to distinguish the importance of one piece 
of data in relation to others being indexed (where appro- 
Driawe, and constraints which are to apply to excention 


data (what field of endeavor is free of exceptions!). 





The processing parameter is usually dictated by the 
aveilable machine capability. Of course, a large index- 
ing operation could require purchase or rental of equip- 
ment to perform the specific task at hand. In this case, 
computer hardware configurations having variable-word- 
length features, storage capacity to fit data collection, 
and high-access-speed to storage would play ea large part 
in the selection of a computer to perform the task, The 
parameter of time is simply @& pressure on the decision- 
makers responsible for the information hendling project. 
Ample time allows for optimum deta collection, mechine 
selection, and output preparetion. Stringent time re- 
quirements may adversely affect each element. 

Adcitional Considerations Future exvansion of data files 
imposes the requirement for elasticity in the construction 
of the initial indexing procedure. This is often called 
"open-file" construction. When the possibility exists 
that a particular data collection will grow over én ex- 
tended period and that repeated indexing will be required 
(4.e. monthly, bi-monthly, etc.), system design must pro- 
vide for the merging of new records with old. While much 
of the merging and sorting may be accomvlished by machine, 
there remains the problem of how new data is to be merged- 
whether the criterion will be man-assigned codes or wheth- 
er the comvuter is to be programmed to perform recognition 
chores. The latter is certainly desirable when the data 


may be recognized and stored without inclusion of ea value 





descriptor. However, when relative-judgements are re- 
quired, man is still required to assign the “opinion" 
code. 

The Sepeneast scan method of expediting infor- 
mation retrieval is one step toward the day when it will 
be the function of the library to retrieve information, 
thereby freeing the researcher to spend his time digest- 
ing and evaluating the information in the field under 
study. “Research workers today, in danger of suffocation 
under the weight of published material, should not have to 
do the extra work of discovering what documents are actu- 
ally relevent - searching the literature - unless they 
have some particular reason for wanting to do so." - 

‘ As the trend continues toward automated library ser- 
vice, information retrieval will be greatly facilitated, 
but new problems will arise. Specifically, the economics 
of running a large computer-oriented librery solely for 
research are of a magnitude to provoke extensive debate, 
as the concept is not yet universally accepted. Coupled 
with the economic questions are the very reel cuestions 
of who will operate this advanced system of high sneed 
deta retrieval. The thought of having the average user 
push buttons to operate a system with which the user is 
not totally familiar appeers impractical and leads to the 
conclusion thst trained information retrievel personnel 
will be required to render such an advanced system pro- 


per handling and efficiency of operation. 9 





Implications of Information Retrieval to Military Situ- 


ations "The real expense of information comes in not hav- 
ins it." '° ‘This quote actuelly comes from a trade pub- 
lication describing efforts by the Goodyear Rubber Comvany 
to keep pace with the exvloding volume of technical infor- 
mation pertinent to the rubber industry. However, the 
quote states a truism of military doctrine which over the 
years has been expressed in a variety of patriotic and 
political slogans - 
"To be forewarned is to be forearmed" 
"Know your adversary" 
"In war, the winner spends too much, 
the vanquished spends POO was tien 
Each of these expressions (and many more of similar intent) 
deals with knowledge of the opponent and vresupposes a 
complete knowledge of the sveaker's own internal overa- 
tions. 
Today the "expense" of missing informetion no longer 
can be calculated in terms of missing intelligence or 
poor scouting reports. The electronic, atomic, comput- 
erized Henense) Gata cieiene which this country has ac— 
quired for itself also demands a high degree of information 
gathering in terms of whet inventories are available, where 
they are located, whether parts are interchangeable, what 
is the best delivery path, etc. Thus, the techniques for 
collecting data, indexing, publishing, and distributing 


large numbers of cross-reference manuels to end-users 





around the world find increasing application in the mil- 


itary as well as civilian industry. 





Chapter III 


A PROBLEM IN PERMUTED TITLE INDEXING 


Date <A collection of Chemical Engineering M.S. and Ph.D. 
thesis titles with author nemes, school, and year of pub- 
lication exists in the Denartment of Chemical and Petro- 
leum Engineering. This collection, comprised of 3000 
titles, has been assimilated from 144 universities in the 
United States and Caneda and has been stored on punched 
cards. Each record consists of at least three vunched 
cards and ranges to a maximum of six cards. The for- 
mat of the input data is listed below. 
ist card Serial code (Columns 1-4) 
| Author name (Colurnins 6-38) 
Year of publication (Columns 40-43) 
2nd card: Title (Columns 7-55) 
3rd card Title continued (Columns @&-55) optional 
4th cerd Title continued (Columns &-55) optionel 
Sth card Title continued (Columns 8-55) ontional 
Last cerd School neme (Columns 9-48) 
Appendix B on page61 shows a print-out of sample invut 
date. The University of Kansas computer (IBM 7TO040-16K) 
was utilized to convert the date to printed format and 
to construct a permuted title index. 
Method of Solution The problem is subdivided into six 
general sub-areas for discussion: 
Record processing 


Error detection 





Keyword identification 

Tane loading 

Sorting 

Write-out procedures 
A general flow chart is shown on page 19 as Figure 2, 
Record Processing A set of cards comprising a single 
record is read into the computer. Based on the pre- 
assigned format for each card ina set, the computer stores 
the SETETLA on from each card, formuletes the code (serial 
number of title, yeer of publication, school code number) 
which will be carried forward in each step of the progran, 
searches for words, tests the words found egainst a diction- 
ary of discard words , and records the information on three 
tapes: a master record tape, a keyword tape, and an author 
tape. 

This procedure is repeated until the last cerd, which 
is coded "last card," is read. The main-line computer pro- 
cream is listed in Appendix A, pages 33 through 44, 

Error Detection ie each card is read, its format is com- 
pared with the vre-assigned formets for each cerd in the set, 
and, if an exception exists, the exception card is vrinted 
out with a message indicating whether the error exists in 
author, title, or school card. Corrective action is,then 
taken manually by repunching the date card which has been 
found to be incompatible with the specified format. 

Spelling errors are not detected by this procedure 


and, in fact, are ignored in the preparation of the per- 





ee 


muted index. One excention to the spelJing error rule 
exists when formulating the school code number. A list 

of e11 schools apolicable to the problem is stored in the 
comouter. The school data card inforration is compared 

to this list until an exact comvarison exists. The code 
corresponding to the school, as listed in Apnendix C, page 
62 , is then assigned. Should an exact comvarison not exist, 
an error message will be vrinted showing the school data. 
card which is incompatible with Mie wec HOGHING LC tl Onatay 
file. If the school simvly is not listed, an edditional 
entry must be added to tke school dictionery. If the 
school data card is misspelled, repunching is oe 
Keyword Identification When a record hes been determined 
to be in proper format, the title cards are searched cher- 
acter by character. Any combination of letters with a 
blank character at both ends of the combination is con- 
sidered a possible keyword which is then compared to the 
words in the discard dictionary. If the combination of 
characters being tested corresponds with a word in the 
discard dictionary, no further ee taken ondepne 
orogrem proceeds to locate the next succeeding groun of 
characters in the title cards. If >the combination of 
‘characters does not coincide with any discard word, a 
keyword is determined and is stored on the tape contéein- 
ins keywords, location of the first character of the key- 
word in relation to the full title, and the serial code. 
This procedure is repeated on each of the four title 


data arrays until two cheracters in succession are deter- 





mined to be blank. Thus, if 2 comnlete title exists on 

a single invout data card, the program will sense the end 
of the first card when 1) the column count reaches the 
limit prescribed for the card being tested or 2) two 
successive blenks are determined. At this point, the 
program will check in succession the first column of 

eech of the remaining title data arrays to determine the 
existence or non-existence of additional titie information. 
If non-existence is determined in eech remaining arrey, 
end of title is confirmed and the program vroceeds to the 
next element. 

Tape Loading Three taves are loaded with data to be used 
in printing outnut listings. First, a master bibliographic 
tepe is prevered which contains serial code, full title, 
and author as a single entry. Second, a keyword tape 

is prepared. Each keyword which is determined is loaded 
as a single entry on this tape. Serial code and keyword 
locetion are carried along as mentioned above.  lrirdyacn 
euthor tape is prepared consisting of serial code and 
author name. | 

Sorting The master bibliographic tave is loaded in order 
by serial code and no sorting is performed on this teve. 
The keyword tave is sorted alvhabetically using the IBM 


"l for the 7040 Computer. This 


Generalized Sort System 
sort program is shown in Avvnendix A, pese 59. The author 
tepe was not sorted due to the fact thet seriel codes are 


assigned alphabetically by euthor neme. However, if the 





file were increased end serial codes were assivned on a 

basis other than alpvhabetic-author, this tave is so con- 
structed that it may be sorted by the same sorting vro- 

gram that is used to sort the keyword teve. 

write-out Procedures Seperate write-out pnrorrems for each 
outnut teve are listed in Annendix A on pages 50, 52, and 
'66- The vrograms for the master bibliogrephic tape and auth- 
or tape are straightforward and ‘involve only reed-write 
overations within the comvuter. 

The progrem for writing the KWIC index requires both 
the sorted keyword tape end master bibliorranhic teve eas 
innut data to the computer. The keyword tave is reed, the 
master tepe is searched to find the bibliographic listing 
corresponding to the code apnlicable to the keyword being 
listed, the title is positioned in a holding array within 
the computer by the keyword Jocation, and the permuted 
title is then Wien oli 

When a keyword is Jocated near the end of a lentthy 
title, much of the meaning of the title may be lost due to 
truncation of the title. In this event, vrovisions ere 
included in the write-out program for the keyword tave 
Tom whaAp=around the first forty characters of the title. 
Descrivtion of Programs The computer instructions for the 
vroblem are divided into five independent segments: 


1- Main vrogrem to orocess date and loed t*vnes 
(Appendix A, pege 2%) 


2e- Sorting instructions 
(Appendix A, page 59) 





3- Write-out of master bibliographic list 
(Appendix A, vage 50) 


4- Write-out of vnermuted title index 
(Aopendix A, pare 52) 


5- Write-out of author list 
(Appendix A, vage 56) 


The Main combuter progrem has the basic functions of 
checking and converting. Checking: Each data card must 
be checked for correctness of the format, end notice of 
error must be returned to the programmer by a computer 
generated error message. Convertins: Once the deta hes 
been transferred from the storage medium of punched cards 
to the medium of electronic core storage, the vhysical 
restrictions of record size (i.e. eighty columns of 
vunched data per cerd) are removed, and records of greater 
length than eighty characters may be established in the 
computer. As keywords are identified, the informetion is 
transferred from one storage medium (core) to enother 
(tape). Figure T at the tov of the next page describes 
the final format of the data as it is transferred from 
the computer end stored on tepe. 

Four subvrogrems are utilized in connection with the 
mein program: 

“4- The NBRT function Ssubprogrem is used to de- 
termine whether a cheracter, being tested is 
numeric or non-numeric. The answer returned 
to the main program is either true or felse. 
(True if character under test is e number; 


false if not e number). 
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2~ The PACK subvrogrem, whick is written in 
the Macro Assembly Progrem (MAP), is used 
to speed-up the ieee! search vrocedure 
which determines school-code numbers. this 
subprogram condenses or PACKS characters 
which ere stored one character per comouter 
word to eae maximum of six charecters per 
computer word. Thus, the school-datea- 
card information which is read into the 
computer ona cherecter by cheracter basis 
may be converted to allow faster comparison 


to school dictionery which is read and stored 





Sex aluhabetic hele tors ner computer word. 

3- The KSKOL function Subprogram is used in 
the formulation of tre serial-year-school 
code number. The program searches the school 
dictionary and returns the school code num- 
ber to the main in 

4—- The KICK subroutine subvrogram verforms the 
decision-making function of whether a groun 
of charecters under test constitutes a discard 
word or a Keyword. If the groun of cherecters 
being tested does not corresnond exactly to 
any word in the discard dictionary, the word 
is returned to the main program for continued 
processing. Otherwise, @ zero signal is re- 
turned to the main program to initiate search 
for the next ErouD OF CHharee ters. 

The second segment of computer instructions is the 


oorting instructions. The tave containing the informetion 





& 


to be sorted is mounted on the tape unit specified by the 
proszrammer, and the control cards for the IBM Generalized 
Sort System are read into the computer. When sorting is 
complete, the sorted tane will be loceted on the tape-drive 
assembly selected by the computer - the identificetion of 
which will be listed on the computer console typewriter 
me eeacs 

The third, fourth, end fifth segments of the instruct- 


ions are programs to list or print-out the information 
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stored on the three tapes involved. Each write-out pnrogram 
reauires its corresponding tave to be mounted on a speci- 
fied tepe drive assembly as input data to the write-out 
program. The write-out vrogrem for the permuted title 


index also requires the master bibliogYravhic tepne as in- 


put data as mentioned above. 
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Chapter IV 


ANALYSIS OF PROCEDURES AND COMPUTER TECHNIQUES 


The key element in creating 2a pnermuted title index 
is the keyword file. This file must be cenable of being 
sorted and must contain the applicable mester title or 
reference code to facilitate search for the master title. 
The oreferable method of obteining titles is tomiimedade 
them in the keyword records. In this manner, titles are 
"carried along" es keywords are sorted. This orocedure 
allows direct write-out of the sorted tave and obviates 
the need to search the mester record file. This pvro- 
cedure was attempted in the solution of this problem and 
was successfully used on a test data deck consisting of 
sixty sets of data. However, comvlications arose when 
the full three thousand record sets were processed. The key- 
word tape file required six tape reels to store all key- 
words found plus the apvlicable master record. This 
fact rendered the ce to the vroblem impossible with 
the computer equipment at hend because the generalized 
Sort system for the University of Kansas computer will 
sort only to a maximum of three reels of tave. 

Alternate Strategies The need to reduce the size of the 
keyword file being avparent, it was decided to eliminate 
the master listing carried along with the eed and 

serial code. This change allowed a reduction in record 
length from 246 words ver record to 29 words ner record. 


This change, in reducing the keyword tape by a factor 
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of eight, did circumvent the physical limitation imvosed 
on the preferred solution. But, and this is an important 
but, the solution became dependent on time consuming tepe- 
search procedures, 

Another alternative, which was not tried but which now 
apvears most sensible, would be to compress the keyword 
file by a factor of six. This could be accomplished 
through the use of the PACK subprogram (page 58) which 
is used in the school-code determination program. By 
condensing the keyword file, slight modification would 
be required to be mede to field specifications in the 
SORT control cards, but this would pose no problem, 

Once the keyword tape was sorted, write-out procedures 
could be executed with one small addition to the write- 
out program, The compressed words on the keyword tape 
must be de-compressed or UNPACKED in order that character 
by character control may be exercised in the positioning 
of the vermuted title. The unvackinge oneration could be 
performed through the use of the subprogrem UNPACK which 
is listed in Appendix A, vage 58 , 


Utilization of Comvouter Time Because of the need to use 





the teve storage media rather than internal comvuter mem- 
By to hold the master file, keyword file, end ao file, 
much of the advantage gained in using a high speed comnuter 
is blunted by a repeated tape-READ and tape-WRITE procedures 
which are inherently slower operations than the movement 


of data within core memory. The program presented with 
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this paper essentially deals with one record at a time, and 
no analysis is made to determine the optimum use of core 
storage to sveed the computer oneretions. The matter 

of computer time usage is an economic one which must be 
taken into consideration in weighing the cost of fenerating 
permuted titie indexes versus the exvected gain in research. 
DroOaduc vac, . 

Some methods ro optimizing the use of computer time 
are discussed below as apynlied to the problem et hend. 
First, end most importantly, the strategy of tave-search- 
ing must be avoided if vossible because each increase in 
file size requires greater time to locate each title record. 


The need to "carry alons" 


the title eas keywords are sorted 
is of paramount importance when using a Siie-orkenves sys- 
tem. Disk storage, with its high access speed, would 
lessen the dilemma if sufficient disk svace were avail- 
able to hold the complete files stored on tape in the teane- 
oriented system. 
If 211 evenues of escape — closed and teve-search 
procedures remain as the only solution to the problem, 
much time may still be saved by incorporating a cuel meester 
file. This vrocedure involves duplicate master file tapes. 
The program is written to seeérch each tave on an alternating 
besis so that when one search is complete, and the tepe is 
rewinding, the program may procecd to search the alternate 
tepe without waiting for the comvletion of rewind procedures. 
Secondly, the keyword file itself’may be reduced in 


size by increasing the size of the discard dictionery. 
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This solution will reduce sort and write-out time, but it 


will renuire ‘longer running time in loading the tanes. 


The fifteen common words, in order of occurrence, which 


are listed below account for 30% of all words in titles. re 
Parone | 6. by 112 Wo 
2. the ts. & 12. some 
3. and 8. waethi toe sas 
4. in 9 sperm ioeat 
Deon 1Oge" Pron 15. an 


These fifteen words are the work horses of pronaeeera 
Gictionary. The incorpvoration of adsitional discerd words 
in the dictionary provides for neater output listing, but 
the increase requires much additional computer running 
time. On the test data, increasing the discard dictionary 
from fifteen to thirty words ceused running time to increase 
in approximately the same ratio (1-2). 

Thus, in designing an optimum system, some balence 
must be reached between increases in discard dictionery 
size and elimination of low-value words in the nernuted 
index. The list of discard words for this problem is 
shown in Apvendix A, page 60. 

The third area of discussion vertains to READ-WRITE 
operations. At the comnletion of vrorrem de-bugring, Boe 
maining unused core space should be divided into two hold- 
ing arrays. This would allow READ-in of more then one date 
set ver READ command, and more than one WRITE-output per 
WRITE commend. By reducing the number of referrals to 


peripheral equipment from the mein computer, additional 





24 


time would be conserved. 

Interpretation of Output Appendix C, fold-out pere 62, 
Shows samples of three elements of the permuted title 
index. On the left is the school code dietionery. Bide 
center section shows the permuted index, end the right 
hend section, the master bibliographic listing. The school 
Gictionery is in numerical order; the permuted index is in 
alphabetical order by keyword; end the master Jisting is 
in order of serial code number. To obtain Sie ssinevets 
title which pertains to a keyword of interest to the re- 
searcher, the serial code serves as reference to enter, 
the master listing. The school of publication may be read 
directly using the school code number which is part of the 


serial-year-school code number, 





Chapter V 


CONCLUSION 


Two questions were posed at the outset of this prob- 
lems: 

i- Did a feasible solution to the problem of per- 
muted title indexing exist when a fixed-word- 
length computer wes used rather than the 
apparentiy more logical variable-word-length 
computer? 

2- Given thet a fixed-word-length computer would 
be used in the solution of the vroblem in per- 
muted title indexing, did a feasible solution 
exist with the use of the engcineering-ori- 
ented computer lanseuage FORTRAN? 

The answers to these two ouestions are in the affirmetive, 
but with some qualifications. 
summation of Findings 

1- Permuted title index problems may be solved 
using fixed-word-length computers. 

2- FORTRAN computer language is adequate to per- 
form the tasks involved in creating 2 ver- 
muted index. Variable format cepebility, 
wnile not a mandatory requirement, would 
have simplified the WRITE portions of the 
overall vroframn. 

3- Discard dictionary size should be kent as 


small as practicable, 
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4~ Keyword files which will be sorted by key- 
werdwsrould contaim full tattle if otea)] 
possible. The vrocedure of tene-search- 
ing should be avoided when other solutions 
are available, 
Projections of the Future’ This project involved the classic 
problems of creating 2 vermuted title index. As mechine 
cépabilities increase, eas nrogramming languarses iIncorno- 
rate new concepts avnlicable to information retrieval, 
and as the need for more deteiled research tools become 
aovarent, it will be necessary to improve the permuted 
index from a simple alphabetic listing to a more de-. 
tailed listing involving heirarchies of information. 
This will reouire esteblishment of new and different 
CrimermcentOr SOrting. 

This tyve of informetion retrieval tool hes some of 
its most promising applicetions in the field of engineering, 
both prectical and theoretical. 

The question remains whetnerienctineers will” provide 
their own research tools or wait eee librarians to 
Go it for them. 


Food for Thought "When computers have been programmed to 





nose out more new mathematical proofs, when one-pack 
vetiences have been mechenically vleyed to stimulate 

the nature of a control system, when a comvuter is used, 

but reliably (as it will be), to paint pictures end write 
poems, and eouinped with a machine-sized theseurus to treans- 


late and therefore comparatively to identify differences of 
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context in metaphysical and theological statements in 
different languages, when all this hanvens, will the pro- 
gerammer, the analytical wielder of this new mathemetical 


paintbrush, be an artist, or will he be a scientist?” 13 
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FLOW CHART FOR PROGRAN TO WRITE-OUT 
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TAPE AQ AND PERMUTED -TITLE-- LNDEX 


READ 
L A/O 


BLK = £ (37) 


wo YES 


FEWIND WVTAPER 


IT sIsave , /50 


NCL (Z,7) 
= BLINK 






BFEWIID TAPE L. 









ao 


T-LTsaP, 42 







wo YES 


WRITE(L). WE. BLK 






YES 





NO M/MITE (4), £4. B48 
-OR. 


Te D.E@ Diy 





NO 


LOL = w7alasy) [FremFe2 





Soe NRirE (LZ) = 
WRITE (I) wri(z + 12) 
=- BLANK 

SL = ree 
LLOC = LOCr2E0 
ZT=2l,2oo 
WAITE (I +200 
NTL (L412) 
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27 FURMAT (5X, 29al, x 6X, 29/ coy ore 29A1) 
Ze CONS INCE 
Tite eeficcd te hee oilatGe 711) GO TO 3C 
Ger 29 ee ee 
BC COMTINUE 
RERENCG NAPE? 


a RR RATES laa Ag Ep A — S-  N e 


a ces ee me cre ce ce me em ee cm cm ce ce re me ee ne mee me ee ee ee ee ee es ee es ee ee ee ee ee _—_——_— — — on oe 


SS 


a ee em ee ee ew - ye ee ee ee ee ee ee ee —— — = 


== = =D =D == 


a a te ee RE CR A RD > a ae my em ge cp me ee ee 


PAUSE 40 
REWINC NTAPEL 
RCUTINE FCR WRITING KwIC LIST 
WRITE (6,12) 
12 FORMAT (1H1,40X,20HPERMUTEC TITTLE «INDEX, //7/7) 
WRITE (6,12) 
12 FCRMAT (41X, 7HKEYnWGRO, 72X, LIHCONDE-YR-SCH, /) 
15 READ (NATAPE2) (NT2(J)», J=122S) 
IF ((NT201)-EQ.-LL) ~AND.T NIT 21270606 DY) SAND. (NT2(3) cE ire Le} ) 
lL = GU" Te. 20 


—_—— SS oe oo — ~~ — ee ee eee — 


——— so 


a)! CCRTINGE 





a ee FUMES UIC CTS) TR be 0 
SEGGRCE STATEMENT Se 


READ (ATAPEL) (NTIS), J=1,246) 3 
IFN 2 26NwcoelibemleaeeGe eile i$o© = . . «ts... 
[F CONTLC1) SEQeLL) AND. (NTI(2) -EQ-LL) -AND-(NTI(3).0C.LLIIGO IC 20 
_ Cee TGs] ss 
42 CCNTINLE 
IF (NT2(27).NE-NT1(02)) GO TO 41 
IF (NT2(28).NE-NT1(3)) GO TO 42 
IF (NT2(29).NE.NTI(4)) GO TO 41 
Ge TC 45 
45 CCATIANUE oo: ee 


ate = am ewe me ee meee eee wee ee eee ree eee ee wee we ee ee 


—— ee ee ee — om wow = oe 


LCC = NT2(25) Sn 
C LCC IS RELATIVE LOCATICN OF KW BETWEEN L AND 2CO 
: 5 eC ae 
S1 NRITE (1) = RLNK 
(CC SC OTIe Oe ae 
Be 52... 1 =“1,200 | 
EPeRRITE A be2CO}e="NITOL IZ) 
Ll = LLOC-40 
P= (Ute - 
wRAP ARCLND PCRTICN 
| LLl = Ll #1 
| LTEMP = L2 - 4C 3 
| ee em een aes eae 
IF (NRITEC(I).NE. BLNK) GC TC 88 
“@C CCNTINUE ~ TA 
IF ((NRITE(L1).EQ.BLNK) OR. (NRITE(LL1).EO.BLNK)) GO TO 88 


—— ee ee ee ee es ee oe ee a ee — a eed ey ee cee ee ee ee ee ee 
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ee mee es ee me me ee ee ee ee ee ee me oe ee ee ee ee oe ee ec ee ee we cm ee we we we we + ee ee ee ee et ee ee ee ee ee 


7 II = 1 
ie Oo eles LEENP «9 LZ 
‘ada MR oe a a eS a 
Il = TIi+1 
gene CCNTINUE.- J — a 
eo CGNTINUE 


7c ---- 
PRI TEMG SAbINRITECL )s) [Sly C2 eee te) 
ee CRM A Lee (TX. Tlcheiic:  biageer hil 0 > | )pemne gett 00 0) 00s un 
REWIND NTAPE1 





, Ge TC 25 oe a_i 
20. CONTINUE 
RE WING NYAPEQ (000 5 eee i eee 
CALL EXIT | 
END gus ee | me es ga es SS See Sed 
| —_— == — ae 
' 
| Spf ee eee eee ee re a e 
} 
| 
' 
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Appendix A (continued) 


FLOW CHART FOR PROGRAM TO WRITE 
TAPE 3 —~ FTHE AUTHOR LISTING 


READ 
L NTAPES = 4 


PEWIUD A/TAPES 


Bix = £(37) 


LL > Z£(36) 


YES | | 


CALL EXIT 


-OuTr 
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$1 BFTC RITE 


SOURCE STATEMENT 


NODECK 


C PROGRAM TO WRITE NTAPES 3 


C ROUTINE FOR AUTHOR LISTING 


31 


32 
Se. 


36 


40 


1 


DIMENSION NT1(246), 


DIMENSTON L(47) 


READ (551) L 
FORMAT (47A1) 
INTEGER BLNK 
NTAPE3 = 4 
REWIND NTAPE3 
BLNK = L(37) 


WRITE (6,31) 

FORMAT (1H1, 

WRITE (6,532) 

FORMAT (10X, 

READ (NTAPE3) 
tL = L(36) 


40Xy 


NT2(246) 5 


51 


NT3(68), NRITE(60C) 


a —— ee a ee 


L4HAUTHOR LISTING, ////7/) 


6HAUTHOR, 
(NT3(05),» 


4O0X,» GHCODE, / ) | 


J= 1,68) 


— + —+s = 


° 


IF ((NT3(¢1) -EQeLL) eAND.(NT3(2) -EQeLL) sANDs(NT3(3)EQeLL)) _ 


GO TO 40 
WRITE (6,36) 
FORMAT (10X, 
GO TO 35 
CONTINUE 
REWIND NTAPE3 
CALL EXIT 
ENO 


(NT30T)s1T=35967), 


33Al> 


13Xy 


BALy,A3s, 


(NT31 J) e JE21929)5 (NT3IK)»KE1920) 
15X,  20A1) 


wee em me ee mee em ce ce ee cme me me ee ew ee ee ee ee ee ee we ee ee 


Fe te ae oe Se Ne mm emt remy em eR I 
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Oe a et ee ee ee ee ee ee ee ee ee ee ee ee ewe ee ee ee ee ee ee ee ee ee ee ee ee ee ee eee ee ee ee ee ee ——— ow ee a 


ee ee me ay ee ee ee ee ee ee eee es ee es es ee ee ee re es so ms ee ee ee ee es 


ce ee ee ee ee a Oe ee eee ee ee O68 ee ee ee ee ee ee ewe ee ee ee ee ee ee ee ee ee ee ee ee ee ee ee ee ee ee ee ee ee ee ee ee ee 


ee ee — ae ae ee ee ee ee ee me ee ee ee ee ee ee oe —_— = = ~ 


Se ee ee ee ee ee ee ee ee ee eee ee ee - ee ee ee ee ee ee ae ee ee 
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APPENDIX A (continued) = a z 
r : C SUBROUTINE PACK . ae ee ee oe —— ae — — oe 
CC —=—= 
_—— CALL “PAG (NP Keeper] egestas t dry" Go rt ee 
a 
oe --$TBMAP PAK —--- NODECK 9-9 nn neeene eset nee aan cess nnn ne ---- 
ENTRY PACK 
- - PACK—*" TRA Wee - soeeeee saa Bape oan ae anne anaes $----22-- = - 
SXA GEORGE 94 | 
a | A 2 
PCS* 39490 
—=- —e ae A CME — 2 gt 9 OE ae sr eee 
PCS* 49450 
—— — -— =. . —t a wns SAC* eave ny ant oe 2 9 4 9 1 = —— een ene ae en eee oe es aS me ap oem a ees ey me me en a ee ee es es ee ee ee a a ee ——— i 
PCS* 53450 
a a ce 29492- in aren ap aeaeoa mens = 
PCS* 69450 
-——-—— -- ) —==-=- SAC -----2. 9 49 3 -------- ------  o - ---- - - eee $ e- - +--+ ----- 
PCS* 19490 ; 
Ba an a GACH H-----2 py ------ aoa --nn-~ +--+ ---------------- --- $= - == +--+ == === = 5 = +--+ ---- 
P'CS* 894590 
a cemrn SCD  eCnme 
GEORGE AXT HH yG 
_— oo eee) RR AY ~~ -— ee CK - <9" age aOR Ga CS ee ee 
END 
— ..--""" @ SUBROUTTPNE-UNPACK (722m ie pre 
a 
> CALL UNPACK (NPAK ye -NiloN29N3 9NGsNS9NE—}-——— 
Gee 
=== BMA P= UNPAK~~-— NODE CK nnn on 
ENTRY UNPACK 
-— UNPAG@K- TRA —~"=7q 8 Oe * -— ~~ nner ro Se ee s 
: SXA GEORGE 94 . 
a EC 8 29 
PCS*., 29490 
ee SAC = Sint | On ee ee ee 
Pesos 29491 
on Cee 4949Q-- mr tril r? Ang or ne ae "ahs 
PCS* 29492 | 
SAC* 5 9 ye 
PCS* 29493 
SAC# ~~~ Ory Grp Qame nn ee Sg a ee ee 
PCS* 29494 
SAC* 79450 wa eee senna cc 
Pcs# 29495 
—— SAC *¥——-8 9450 ae - ee - 
| GEORGE AXT HG 
|s | TRA UNPACK  - ee = 





59 
APPENDIX A (continued) 


CONTROL CARDS FOR SORT 


$JOB 0694 »DRURY ’ 
$PAUSE 20 

$TIME 010 

$IBSRT NOTYPE 


PFILEsINPUT/2» REELS/13 MODE/Bs BLOCKSIZE/030s UNLOAD 

FILEsOUTPUT»s MODE/Bs BLOCKSIZE/030>% UNLOAD 

RECORD» TYPE/Fs LENGTH/0309 FIELD/(6936Bs36Bs36B 936B936Bs36B>» 
1 36B936B »36B936B i?) 

SORTs FILE/2s SEQ/Cs GRDERJZs)) FIELD (2539455365 /98s9510s11) 

SYSTEMs INPUTCHANNEL/SeSU04s OUTPUT CHANNEL/A»s MERGE CHANNEL/A 

OPTION » NODUMP»s NOCKPT 

END 
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APPENDIX A (continued) 


DISCARD DICTIONARY 


FOR 

— ae ous =a ee 
ie: 

“Tae © , 
AS 


——wee wa tee oe oan - ee ae a nS 
— ee —_— i a - 


ee aw © cee, ~— = ow oe =e eee ee ee meee ee wee we we we we a + 


WN re 6s): 
RETWEEN 
peo Sy EXPER Maes 7 
PHENOMENA 
a). “= PRODUC ee seee 
PTO 
Bs 
RATE 
DO0— ss: : 
ees 
aL eS YS =e 
ARISING 
EE et ee 
Lae 
ae. Oe ee ara AFD ‘ a 
APPLIED 
=e UND Gar 
. STZE 
~~ S10b lee 
os PLANT 








Appendix B 
SAMPLE INPUT DATA 


— 7 mo - —<— oe oe SO am 46 oe oe —— ms ee ee eee oe ok ee 


A042 AMBROSEs TOMMY We 1957 
LOCAL SHELL-SIDE HEAT TRANSFER COEFFICIENTS IN 
BAFFLED TUBULAR HEAT EXCHANGERS 
OREGON STATE COLLEGE _ =e ADA bo, <S, c wf a. 
A043 AMDUR» Ec 1944 
THE DEHYDRATION OF VEGETABLES 
UNIVERSITY OF MINNESOTA 


A044 AMERIGOs FRANK TESI 1943 
KINETIC STUDIES OF THE ADSORPTION OF PHENOL BY | 
ACTIVATED CARBON IN A PACKED TOWER 

UNIVERSITY OF PITTSBURGH 


_-——_ — ee ee ee we es ew = = ee ee ee ee 


AO45 AMICKs Eo Hes JRe 1939 
PRESSURE-VOLUME-TEMPERATURE RELATIONSHIPS OF A 
METHANE~-ISOPENTANE MIXTURE 
'YALE UNIVERSITY 


A046 AMIR-=YEGAHEHs Ac 1962 7 
ON DETERMINATION OF VAPOR-LIQUID EQUILIBRIUM 
PHASE DISTRIBUTION RATIOS OF PETROLEUM OILS 


= ented rr =P apn ten met em 


OKLAHOMA STATE UNIVERSITY 
AO4T ANDERSENs Le Be | em ee 1956 ee 
ABSORPTION-OXIDATION PROCESSES IN DISPERSED MEDIA 


ee eee comme es emt ete cet eet er et ee te me ee ee es ee ee et et pees ee ee ee ee ee ee ee we ee 


UNIVERSITY OF ILLINOIS 


a a eS a an Sp — slp tM SIR I 


“A048 ANDERSONs De Ges JRe 1940 


VOID VOLUME AND FLOW RESISTANCE OF BEDS OF _ 
PARTICLES 
COLUMBIA UNIVERSITY 


- - a —— ae cc re ee cm cr ec ee we es ee ee ee we ee ee ee ee ec ee ee ee ee ee ee we we ee = =; 


‘ 
ee cm we ee i ee 


A049 ANDERSON» DONALD KEITH eo 


MUTUAL DIFFUSION IN ASSOCIATED BINARY LIQUID. : 
MIXTURES 


> UNIVERST IY OF WA&SRIUNG TO NNN tmnt 0 co ueutGG 0] GG Gcec cau oa 
ROSUMMNDERSONS Fe Aw G0 neue 000 ne ROG ack :~:CO ee 
STUDY OF METHODS FOR PROCESSING TALL OIL 


=———— RR mm me tt 


LOUISIANA STATE UNIVERSITY 


—- ee es ee ee ee ee ee ee ee ee 2 te SF am a ee ae ee ee ce ce ee ee ee ee ee ie ee 


AO51 ANDERSON»s Je Eo | 1953 5; 
EVALUATION OF CATALYSTS FOR. HYDROCARBON  ———“‘(‘(‘iCCCtCt™; 
OXIDATION 
IOWA STATE UNIVERSITY i a) 
AO52 ANDERSON» Je Ee a (1955 es lit” 
MASS TRANSFER IN A DISTILLATION COLUMN 
MASSACHUSETTS INSTITUTE OF TECHNOLOGY eh. 3 
A053 ANDERSONs Soe 1950 


PREPARATION OF NITROGENOUS FERTILIZERS BY ~ "re 
TREATMENT OF CELLULOSIC MATERIALS WITH AMMONIA | 
IOWA STATE UNIVERSITY ts ot ae a menos 
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